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DELIVERY OF TREFOIL PEPTIDES 

The present invention relates to the field of in vivo protein delivery systems. 
More particularly, the present invention relates to the secretion in vivo of trefoil 
5 peptides by micro-organisms, preferably bacterial strains, preferably non- 
pathogenic strains, preferably non-invasive strains, preferably food grade strains, 
methods for delivering trefoil peptides using said systems and the use of said 
trefoil peptide expression systems for treatment of inflammatory disorders of the 
gastro-intestinal tract. 

10 Lactococcus lactis is a Gram-positive non-pathogenic lactic acid bacterium 

which can survive in the intestine (Klijn et a/., 1995). It is not certain whether L. 
lactis can also be metabolically active in all of these environments. 

The expression of tetanus toxin fragment C by Lactococcus lactis in view of 
vaccination was described by Wells et al. (1993b) and Robinson et ah (1997). 

15 Further, it was demonstrated that when preparations of L. lactis bacteria 
engineered to express either lnterieukin-2 or lnlerieukln-6 together with tetanus 
toxin fragment C (TTFC) were administered Intranasally to mice, more than 10 
times more anti-TTFC was produced than after similar administration of strains 
expressing TTFC alone (International patent application published under WO 

20 97/14806). These results prove the use of a cytokine-secreting, non-invasive 
experimental bacterial vaccine vector to enhance immune responses to a co- 
expressed antigen. 

Trefoil peptides are secreted by epithelial mucus cells and are stable in an 
acid environment. These peptides contribute to the protection of the mucosa 

25 (formation of a gel over the epithelium) and are probably involved in the repair of 
damaged mucosa by stimulation of epithelial migration (Playford ef a/., 1996). The 
production of trefoil peptides increases locally on places where damage occurs 
such as gastric ulcers and colitis (Wright ef a/., 1990). Babyatsky ef a/. (1996) 
have shown that the administration of recombinant trefoil peptides reduces the 

30 damage at those places. In contradiction with most other proteins that are 
important for the protection of the mucosa (such as epidermal growth factor), most 
studies have demonstrated that trefoil peptides cause little or no proliferation, and 
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that they might even inhibit proliferation (Playford etal., 1996). Three mennbers of 
this family of trefoil peptides have been identified in humans and originally 
designated: pS2 (breast cancer oestrogen inducible gene), SP (spasmolytic 
peptide)^ and ITF (intestinal trefoil factor). In the present nomenclature pS2 is 
5 renamed as TFF1, SP as TFF2 and ITF as TFF3 (see e.g.^Wong ef a/., 1999). 
This new nomenclature will be used throughout the present text. 

Both in humans, mice and rat TFF1 and TFF2 are predominantly found in 
the stomach while TFF3 is predominantly found in the duodenum and colon. 
Wong et al. (1999) give a recent overview of trefoil peptides. The contents of this 

10 article are incorporated by reference in the present disclosure. 

The use of trefoil proteins or peptides for treating of disorders of and 
damage to the alimentary canal, including the mouth, oesophagus, stomach, and 
large and small intestine, as well as for the protection and treatment of tissues that 
lie outside the alimentary canal are .described In WO 97/38f 12. These proteins 

15 can be^us.edaeithere tostreatsslesioRS-in theseirfareaswr^rto^inhibit the formation of 
lesions. These lesions 'can be caused bywadiation therapy or chemotherapy for 
the treatment of canceri; any other-^'drugdncludingE^alcoholstwhich damages the 
alimentary- canal- aecidentaii exposu re-^to radiation or tea "'caustic substance, 
infection,, a digestive disorder includiing4JUt^:not-limited to non-ulcer dyspepsia, 

20 gastritis, peptic or duodenal ulcer, gastric cancer, MALT lymphoma, Menetier's 
syndrome, gastro-oesophageal reflux disease, Crohn's disease, ulcerative colitis 
and acute colitis of chemical, bacterial or obscure origin. 

Trefoil peptides are particularly useful to treat acute colitis. 

ITF has also been used in combination with EGF (epidermal growth factor) 

25 for treating gastro-intestinal tract ulcers. In vitro and in vivo experiments have 
shown that the wound healing activities of "EGF are**markedly increased by 
treatment of EGF in combination with^HTF, without increasing the proliferative 
action of EGF (Chinery and Playford«>n.1 99&): 

Inflammatory bowel disease is the group name, for a range of gastro- 

30 intestinal inflammations. Belonging to this group are enteritis, colitis, inflammations 
of respectively the mucosa of the duodenum or the colon. Crohn's disease 
(enteritis regionalis) and ulcerative colitis (colitis ulcerosa) are closely related, 
chronic and spontaneously recurring diseases of the gastro-intestinal tract. These 



diseases are immunologically mediated and have environmental and genetic 
causes. Sartor (1995) describes the different aspects of Inflammatory bowel 
disease. Crohn's disease has been particularly studied by for instance Herfath and 
Sartor, (1994), Cominelll etal. (1994), and MacDermott (1989). 
5 The aim of the present invention is to provide a method for delivering trefoil 

peptides to treat gastro-lntestinal disorders. 

Another aim of the present Invention is to provide a pharmaceutical 
composition for treating gastro-intestinal disorders. 

The present invention relates more particularly to a micro-organism 

10 delivering a trefoil peptide in vivo. Preferentially said micro-organism Is a bacterial 
strain, preferably a non-pathogenic strain, preferably a non-invasive strain, 
preferably a food grade strain, more preferably a gram-positive bacterial strain, 
most preferably a lactic acid fermenting bacterial strain, preferably a Lactococcus 
or a Lactobacillus species expressing a trefoil peptide in vivo. 

15 It was not obvious from the capacity of Lactococcus iactis to deliver one 

heterologous antigen or its ability to produce molecules such as IL-2 and IL-6 in 
vitro and in vivo that bacteria would be an appropriate vehicle for delivery of other 
types of peptides or polypeptides in vivo. Further It is unknown whether said trefoil 
peptides expressed by said bacterial strains will provide a beneficial effect to 

20 inflammatory diseases of the gastro-lntestinal tract, such as Inflammatory bowel 
disease or acute colitis. 

It is, therefore, surprising that It could be demonstrated in the present 
Examples section that bacterial strains are able to express trefoil peptides in vivo 
when present in the gastro-intestinal canal and exert a healing effect in acute 

25 colitis situations. By way of example, PGR fragments containing the coding region 
of rat TFF3 and mouse TFF1 were cloned. Recombinant vectors comprising these 
PGR clones under the control of a promoter and the usp45 Lactococcus Iactis 
secretion signal sequence were constructed. Four transformed Lactococcus Iactis 
strains were constructed which express either rat TFF3 or mouse TFF1 trefoil 

30 peptides. It was further shown in an in vivo mice model system that both 
recombinant rTFF3 and recombinant mTFFI produced by these bacteria can 
surprisingly exert healing effects on the distal part of the inflamed colon. 
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According to a preferred embodiment, the present invention relates 
particularly to a bacterial strain delivering intestinal trefoil polypeptide (TFF3) in 
vivo. 

According to another preferred embodiment, the present invention relates 
5 to a bacterium delivering TFF1 in vivOi 

It is to be understood that the present invention also relates to micro- 
organisms parts or variants of any trefoil peptide. Said parts refer to biologically 
active parts which can be generated by methods known to those skilled in the art. 
These parts will generally contain at least 10 contiguous amino acids, typically at 
10 least 20 contiguous amino acids, more typically at least 30 contiguous amino 
acids, usually at least 40 contiguous amino acids, and preferably at least 50 
contiguous amino acids. Said variants refer to variants which have the same 
biological activity as the above mentioned trefoil peptides. 

It should also *be clear that bacterial strains according to the present 
1 5 invention>/as ?de«inedwabove, .may^ also express^additionabrecombinant proteins 
which are^beneficialito^the treatment of:rany envisaged disordan^ 

According , to *an©ther^embodii5nent, .the«t present^ invention relates to a 
method for the-delivery of trefoil peptide to the gastro-intestinal tract comprising 
the administration of a micro-organism as defined^above. 
20 According to another aspect, the present invention also relates to the use of 

a micro-organism as defined above for the manufacture of an agent for the 
delivery of trefoil peptide to the gastro-intestinai tract. 

According to yet another embodiment, the present invention relates to a 
pharmaceutical composition comprising a micro-organism expressing a trefoil 
25 peptide as defined above. 

Advantageously, the pharmaceutical composition according to the present 
invention is preferably suitable for application to the mucosaksurfaces. 

Pharmaceutical compositions according to :th(B» present -invention, and for 
use in accordance to the present invention, may comprise, in addition to the 
30 micro-organism, a pharmaceutically acceptable excipient, carrier, buffer, stabiliser 
or other materials well known to those skilled in the art. Such materials should be 
non-toxic and should not interfere with the efficacy of the active ingredient. The 
precise nature of the carrier or other material may depend on the route of 



5 

administration. Those of relevant skill in the art are well able to prepare suitable 
solutions. 

According to another embodiment, the present invention relates to a 
method of treatment of gastro-intestinal inflammatory diseases comprising 
5 administration of a micro-organism delivering a trefoil peptide in vivo as defined 
above. 

According to a preferred embodiment, the present invention relates to a 
method of treatment of gastro-intestinal inflammatory diseases comprising 
administration of a micro-organism delivering a TFF1 trefoil peptide in vivo. 

10 According to another preferred embodiment, the present invention relates 

to a method of treatment of gastro-intestinal inflammatory diseases comprising 
administration of a micro-organism delivering a TFF3 trefoil peptide in vivo. 

Administration of the micro-organism may be orally or by means of any 
other method known in the art allowing the micro-organism to enter the desired 

15 places to be treated, such as e.g. anal, vaginal. The micro-organism may be 
applied in a nutrient medium, i.e. a medium containing a substance or substances 
which sustain (at least in vitro) metabolic activity of the micro-organism. Such 
substances may sustain viability if not growth of the micro-organism. Such 
substances may include an energy source such as glucose, amino acids and so 

20 on. 

The individual to which the micro-organism is administrated may be a 

human or an animal. 

In a therapeutic context, i.e. where the biological effect of delivery of the 

polypeptide to an individual is beneficial to that individual, administration is 
25 preferably in a 'therapeutically effective amount', this being sufficient to show 

benefit to the patient. Such benefit may be at least amelioration of one symptom. 

The actual amount administered, and rate and time-course of administration, will 

depend on the aim of the administration, e.g. the biological effect sought in view of 

the nature and severity of the challenge and is the subject of routine optimisation. 
30 Prescriptions of treatment, for example decisions on dosage etc, is within the 

responsibility of general practitioners and other medical doctors. 
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A composition comprising micro-organisms according to the present 
Invention may be administered in accordance with the present invention alone or 
in combination with other treatments, either simultaneously or sequentially. 

According to another embodiment, the present invention relates to the use 
5 of a micro-organism expressing a trefoil peptide as defined above for the 
preparation of a medicament for the treatment of acute phases of diseases of the 
gastro-intestinal tract or diseases involving lesions at mucosal surfaces. 

These diseases are preferably acute gastro-intestinal disorders of 
chemical, bacterial or obscure origin. Belonging to this group are enteritis, colitis, 

1 0 inflammations of, respectively, the mucosa of the duodenum or the colon. Also 
included is traveller's disease. Crohn's disease (enteritis regionalis) and ulcerative 
colitis (colitis ulcerosa) are closely related, chronic and spontaneously recurring 
diseases of the gastro-intestinal tract. 

The disease states to be treated can also include disorders of and damage 

15 to the alimerntaiyAcaRaly including the mouth, oesGphagys, stomach, and large and 
small intestine,aas well as for. the protection and treatment of tissues that lie 
outside the alimentany canal. The trefoil proteins expressed by the bacterial strains 
according to the present invention can be used either to treat lesions in these 
areas or to inhibit the formation of lesions. These lesions can be caused by: 

20 radiation therapy or chemotherapy for the treatment of cancer, any other drug 
including alcohol which damages the alimentary canal, accidental exposure to 
radiation or to a caustic substance, Infection, a digestive disorder including but not 
limited to non-ulcer dyspepsia, gastritis, peptic or duodenal ulcer, gastric cancer, 
MALT lymphoma, Menetier's syndrome, gastro-oesophageal reflux disease, and 

25 Crohn's disease. 

According to a preferred embodiment, the present invention relates to a 
bacterial strain delivering a trefoil peptide as defined above for use in the 
preparation of a medicament for treating acute colitis, including but not limited to 
acute flare-ups in Crohn's disease and ulcerative colitis. 

30 According to another embodiment, the present invention relates to a 

method for producing a micro-organism delivering a trefoil peptide in vivo as 
defined above comprising transforming a micro-organism with a recombinant 
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vector carrying a trefoil polypeptide coding sequence under the control of a 
suitable promoter and a suitable bacterial secretion signal sequence. 

Said bacterial secretion signal sequence can be any sequence known in 
the art to perform said function. Preferably, for L lactis said secretion signal is the 
5 usp45 L lactis secretion signal sequence. Said promoter sequence can be any 
promoter allowing expression of said coding sequence in said n)icro-organism. 
Examples given in the examples section include the known Inducible E. coli phage 
T7 promoter and the known constitutive P1 promoter of L. lactis. 

The present invention also relates to a recombinant vector comprising at 
10 least part of trefoil peptide coding sequence under the control of a suitable 
promoter and a suitable secretion signal sequence. Said recombinant vector can 
be used to deliver in vivo at least part of a trefoil peptide sequence which can 
exert on healing effect on damaged areas of the mucosal surfaces. 

The following examples merely serve to illustrate the present invention, and 
15 are not to be construed as limiting the invention in any way. 

All documents mentioned in this text are incorporated by reference. 
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FIGURE LEGENDS 

Figure 1: Overview of the plasmids used. 

Figure la : Schematic maps of the plasmids pL2rTFF3, pL2mTFF1v1, pT1rTFF3 
and- pT1 mTFF1 . T7 is the major late promoter.- from coliphage T7 
(Studier and Moffatt, 1986). PI is the lactococcal promoter as in 
Waterfield et ai, (1995), usp45S is a DNA fragment encoding the 
secretion signal peptide from the lactococal Usp45 protein (van 
Asseldonck ef a/., 1990), rtff3 is a DNA fragment encoding the mature 
part of rat TFF3, mtffi Is a DNA fragment encoding the mature part of 
murine TFF1, mtffi v1 is a DNA fragment encoding a truncated 
(missing two aminotermlnal aa residues) mature murine TFF1 , Cm is 
the chloramphenicol selection marker, Em is the erythromycin 
sejectiopvmarker. 

F/^ure Jtos*DISJAirsequ,ernGe«0f^plasmid*pL2riiFFF3-(:SEQJD,N®w^ 

Figure *?c»:..DNAtse.qujBmce of plasmidipb2r*il"FF1 vl". (SEQ.ID:N0 2). 

Figure tdsaFDNi^^ sequence of plasmid5pT1rTFF3 .(SEQ;ID;.N© 3). 

Figure let Dtf A sequence of plasmid pTImTFFT (SEQ ID N© 4). 

Figure 2: SDS-polyacrylamlde gel electrophoresis. The different protein fractions 
are derived from the medium of L lactis MG1820 [pILPOL] (control), 
MG1820 [pILPOL; pL2mTFF1v1] or MG1820 [pILPOL; pL2rTFF3] 
cells. The left lane contains marker proteins wherein the molecular 
weight is given In kDa. The proteins were visualised using Coomassle 
Blue staining. 

Figure 3: Representation of the histological scores of the middle part of the 
colon. Top-left hand side-^gr^phicv epitheliwiin^damage (middle part 
colon). Top right hand side graphic: inflammatory infiltration (middle 
part colon). Bottom graphic: sum of the histological scores of the top 
graphics (middle part colon). 
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Figure 4: Representation of the histological scores of the distal part of the colon. 

Top left hand side graphic: epithelium damage (distal part colon). Top 
right hand side graphic: inflammatory infiltration (distal part colon). 
Bottom graphic: sum of the histological scores of the top graphics 
(distal part colon). 
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Description of the general methods used in this invention 
Culture media 

GM17 is Ml 7 (Difco, Detroit) supplemented with 0.5 w/v % of glucose. M9 
medium contains per litre: 6g of Na2HP04, 3 g of KH2PO4, 1 g of NH4CI, 0.5 g of 
NaCI, 2 mmol of MgS04, 0.1 mmol of CaCle and 5 g of Casitone (Difco), M9B is 
M9 supplemented with 2.1 g of NaHCOa and 2.65 g of NaaCOa per liter.' GM9B is 
M9B supplemented with 0.5 w/v % of glucose. LM9B is M9B supplemented with 
0.5 w/v % of lactose. 

When appropriate the antibiotics, erythromycin (Er) or chloramphenicol 
(Cm), were added to the respective media at final concentrations of 5 jig/ml each. 
The designation used to indicate the presence of antibiotic is, e.g. GI\417Er, 
LM9BCm and so on. Solid media contained 1.2 % agar. 

Recombinant DMA techniques 

DNA modifying enzymes and restriction endonucleases were used under 
standard conditions and in the buffers recommended by the manufacturers. 
30 General molecular cloning techniques and the electrophoresis of DNA and 
proteins were carried out according to standard procedures. L lactis was 
transformed by electroporation of cells grown in the presence of glycin (Wells et 
aL, 1993a), Plasmid DNA was routinely purified using the Qiagen Plasmid Kit 
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PCR amplification of mouse TFF1 (mTFFI) 

The PCR reaction was carried out on plasmid pGEMmpS2 (kindly provided 
by Dr. Marie-Christine RIO, Institut de G6n6tique- et Biologie Moleculaire et 
5 Cellulaire, lllkirch Cedex, France) using the ^oligonucleotide primers mTFFI S and 
mTFFI A. The pGEMmpS2 plasmid was first digested with H/ndlll to obtain a 
linear fragment. The mTFFI S primer corresponds^to the first 18 nucleotides of the 
sense strand of mTFFI from the first nucleotide behind the signal sequence. The 
mTFFI A primer is complementary to the last 26 nucleotides of the sense strand of 
10 mTFFI including the stop codon, and introduces an extra Spel restriction site. 

mTFFI S: 5'-CAGGCCCAGCCCAGGCC -3' ((SEQ ID NO 5) 
mTFFI A: 5'-GCACTAGTTAGAAGGGACATTCTTCTTCTTG AG-3' (SEQ 
I D NO 6) wherein AGTAGl in mTFFI A repjesentSian'^SjErek^^^^ 

15 

P@B lanapjlfiiatioia wasr carnied^0UiUusii3gi^ehtS^**bNA^^^ polymerase (New 
England Biolabs^Biverly, USA)^whiibbrgiyeswa^PGR productacarrying blunt ends. 
The PCR mixture consisted of*2 unitstWent DNA polymerase, lOpI Vent buffer 
(thermopol), 4|j| dXTP's (O.SmM-^rnaximam), Spl {0.5|jM?>of-'©ach primer, Ipl (50 
20 ng) template DNA and 74pl H2O. Six reactions were set up differing in their final 
concentration of MgS04, adjusted to 0, 1, 2, 3, 4 and 5 mM respectively. PCR 
amplification cycles were: To for 300" at 94^C, Ti for 45" at 94°C, T2 for 30" at H 
60*^0, T3 for 20" at 72''C, T4 for 10" at 20*^0. These cycles Ti until T3 were carried 
out 30 times. 

25 PCR amplification with these primers rendered the gene for mature mTFFI 
lacking the signal sequence and including an additional Spel restriction site. After 
checking by gel electrophoresis, the amplified fragment appeared as a band in the 
expected length range;.. The 5' end of. the^mTFFI sequenceis^contains two possible 
target seqweRees-^complementary^to-the'^fow^ consequence two 

30 fragments of 202 base pairs and 208 base pairs respectively can be amplified 
from the mTFFI cDNA by use of the mentioned primers. These fragments are not 
expected to be resolved by agarose gel electrophoresis. 
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PCR amplification of rat TFF3 (rTFF3) 

The PCR reaction was carried out on plasmid pBluerlTF (kindly provided by 
Dr. Daniel K. Podolsl<y, Massachusetts General Hospital, Boston, USA) using the 
oligonucleotide primers rTFF3S and rTFF3A. The rTFF3S primer corresponds to 
the first 23 nucleotides of the sense strand of rTFF3 from the first nucleotide 
behind the signal sequence. The rTFF3A primer is complementary to_tbe_last-23 
nucleotides of the sense strand of rTFF3 including the stop codon, and introduces 
an extra Spel restriction site. 

rTFF3S: 5'-CAGGAATTTGTTGGCCTATCTCC-3' (SEQ ID NO 7) 
rTFF3A: 5'-CGACTAGTTAAAATGTACATTCTGTCTCTTGCA-3* (SEQ ID 
NO 8) wherein ACTAGT in rTFFSA represents an Spel site 

15 The trinucleotide TTA in the antisense primer introduces a TAA stop codon 

which is different from the wild-type rTFF3 stop codon (TGA). PCR amplification 
conditions were as described before under "PCR amplification of mouse TFF1 
(mTFFI)". 

PCR amplification with these primers rendered the gene for mature rTFF3 
20 lacking the signal sequence, with an altered stop codon and including an 
additional Spel restriction site. Following agarose gel electrophoresis and EtBr 
staining a band having the expected size range (187 bp) was detected. 



Construction ofplasmlds 

25 

Two different types of vectors were used as acceptors for either the mTFFI 
or rTFFS trefoil peptide encoding PCR fragments. The primary structure of the two 
parental vectors - pT1NX, derived from pTREXI (Wells and Schofield, 1996), and 
pLET2NX, derived from pLET2N (Steidler et al., 1995) - contains the following 
30 common elements; a promoter (T7 or PI), the LJactis usp45 secretion signal 
sequence (van Asseldonk et al., 1990 and European patent application published 
under No. 0 455 280), modified to contain a Nae\ restriction site overlapping the 
sequence encoding the ultimate aa residue (Steidler et al., 1995), and a 



12 



downstream Spel restriction site. pTINX derived plasmids specify resistance to 
erythromycin; pLET2NX derived plasmids specify resistance to chloramphenicol. 
The PGR fragments were treated for 1 hour at 37°C using SOpI DNA solution, lOpI 
Spel-buffer, 50 units Spel. 10 units- T4 polynucleotide kinase (GIbco BRL, 

5 Bethesda, USA), 0.5 mM^vATP, adjusted to pH 7.5, and 3ep\ HgO. The vector 
pT1NX was digested for 1 hour at 37°C using 10 a 20pl purified DNA, lOpI Nae\ 
buffer, 10 units Nae\, 50 units Spel, 1 unit calf Intestine alkaline phosphatase 
(Boehrlnger, Mannheim, Germany) and 73 h 63pl H2O. After 30 minutes 
Incubation, 50 units of Spel and 10 units of A/ael were again added to the mixture. 

10 The restriction enzymes were Inactivated and extracted from the mixture by 
phenol/chloroform extraction. After restriction digestion, the mTFFI -derived band 
(comprising a 195 bp and a 201 bp fragment as described before under "PGR 
amplification of mouse TFF1 (mTFFI)", the 180 bp rTFF3 - derived fragment and 
the vector parts were excised from -the agarose gel. Following ligation of the 

1 5 respective«P®l3-^fragpernts«aindith'e*ve©t0r«for*«i45^ijninutes«at»1 6%G using "Ready To 
Go" T4 DNA-*ligase«(«Pl^arrnacia BioteGh, UK^srecombinant^plasmids were obtained 
containing4he-mT'FF1'andKthe>irTFF3 cistr©iasi!<as*an in-fraiine«fusion to the usp45 
secretion signal sequence under the control of the promoter. 

The plasiinids^pTI mTFFI and-pTI rTF-FS -(Figu:ne*«1 a), which contain the 

20 constitutive L. lactis PI promoter, resulted from ligation of the purified A/ael - Spel 
vector part of pTI NX and the Spel cut and 5' phosphorylated PGR fragments. 

The plasmids pL2mTFF1v1 and pL2rTFF3 (Figurela), which dontain the 
inducible E. coli phage T7 promoter, resulted from ligation of the purified A/ael - 
Spel vector part of pLET2N and the Spel cut and 5' phosphorylated PGR 

25 fragments. The T7 promoter can only be activated by the cognate T7 RNA 
polymerase encoded by e.g. plasmid pILPOL. TRIs plasmid is present In L. lactis 
strain MG1820 [pILPOL] (Wells ef a/., 1993c): 

For structural analysis plasmidspTlmTFFil and'!pTTrTFF3 were transformed 
into L lactis strain MG1363. The cells were grown on GM.17Er plates. Colonies 

30 were grown in 2.5 ml GM17Er and the plasmids were isolated. By means of an 
analytical digest, the restriction pattern of the pTI NX vector (2mI DNA (pTINX), 20 
units EcoRI, 50 units Spel. 2pl Spel-buffer and 15|jI H2O) and the isolated 
recombinant plasmids (5^1 DNA, 20 units EcoRI, 50 units Spel, 2pl Spel-buffer, 
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0.25 pi of a 10 pg/ml Rnase A stock solution, 12pl H2O) were compared. The 
plasmids were cut with EcoR\ and Spe\ for 1h at SV'C. In the reference plasmids, 
two linear fragments of 907bp and 4999bp are predicted. In pTImTFFI, two 
bands of 499 bp and 4999 bp are predicted. In pT1rTFF3, two bands of 478 bp 
5 and 4999 bp are predicted. The sizes of the experimentally obtained fragments, 
as visualized by agarose gel electrophoresis and EtBr staining, were consistent 
with the predicted lengths. From each recombinant plasmid, one positive culture 
was striked out on GM17Er plates to obtain isolated colonies. One colony was 
subsequently inoculated in 100 ml GM17Er medium and grown to saturation. The 

10 cells were collected and the plasmids were purified. Their physical structure was 
verified by restriction enzyme analysis and agarose gel electrophoresis. In 
addition, sequence analysis revealed that the mTFF1 and rTFF3 cistrons had 
been ligated perfectly in frame with the usp45 secretion leader sequence. 
pTImTFFI contains a 208 bp insert which represents the complete coding 

15 sequence of mature mTFFI (as described before under "PGR amplification of 
mouse TFF1 (mTFFI)"). pT1rTFF3 contains a 180 bp insert encompassing the 
complete coding sequence of mature rTFF3. 

For structural analysis plasmids pL2mTFF1v1 and pL2rTFF3 were 
transformed into strain MG1820[plLPOL]. The cells were grown on GM17Cm 

20 plates. Colonies were grown in 2.5 ml GM17Cm and the plasmids were isolated. 
By means of an analytical digest, the restriction pattern of the pLET2NX vector 
(2pl DNA (pLET2NX), 20 units EcoRI, 50 units Spel, 2pl Spel-buffer and 15pl 
H2O) and the isolated recombinant plasmids (5pl DNA, 20 units EcoRI, 50 units 
Spel, 2pl Spel-buffer, 0.25 pi of a 10 pg/ml Rnase A stock solution, 12pl H2O) 

25 were compared. The recombinant plasmids were cut with EcoRI and Spel for 1 h 
at 37''C. In the reference plasmids, two linear fragments of 907bp and 4650bp are 
predicted. In pL2mTFF1, two bands of 499 bp and 4650 bp are predicted. In 
pL2rTFF3, two bands of 478 bp and 4999 bp are predicted. The sizes of the 
experimentally obtained fragments, as visualized by agarose gel electrophoresis 

30 and EtBr staining, were consistent with the predicted lengths. From each 
recombinant plasmid, one positive culture was striked out on GM17Cm plates to 
obtain isolated colonies. One colony was subsequently inoculated in 100 ml 
GM17Cm medium and grown to saturation. The cells were collected and the 
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plasmlds were purified. Their physical structure was verified by restriction enzyme 
analysis and agarose gel electrophoresis. In addition, sequence analysis revealed 
that the mTFFI and rTFF3 cistrons had been ligated in frame with the usp45 
secretion leader sequence. We analysis-^ further showed that pL2mTFF1v1 
5 contains a 202 bp Insert (consequently missing the first two aminoterminal aa 
residues of mature mTFFI ; as described before under "PGR amplification of 
mouse TFF1 (mTFFI)". pL2rTFF3 contains a 180 bp insert encompassing the 
complete coding sequence of mature rTFF3. 

The sequences of the recombinant plasmids are given in figures lb to 1e. 
10 Their complete sequences were compiled from the published sequences of the 

constituting parts. In addition, relevant sections of the sequences such as PGR " 
fragments and ligation junction points were experimentally verified. 

Protein expression in transformed- L. lactis 

15 

L../a&f/;skstralras' were tramfoteedrwithltheMplasiiiiiidssasiiConstructed above. 
For transformation .of*theip'P1'miFFR«^and4ti6*pT1 rTPFS plasmidSj L. lactis strain 
MG1363 (Glsson, 1983)*was'^^ used. FOrstransformatiomof*'tR&' pL2mTFF1v1 and 
the pL2rTFF3 plasmlds,- L. lactis strain MQ-1 820 (pILPOL) (Maeda and Gasson, 
20 1986) was used. 

The expression of the proteins by these transformed L. lactis strains was 
detected by SDS-PAGE. 9 

To prepare culture supernatant fractions, the cells were grown for 20 hours 
at 28°C in five ml GM17Er medium for pTI-type plasmids or GM17Cm medium for 
25 pL2-type plasmids. The cultures were diluted 1/100 in five ml of either GM17Er or 
GM17Cm medium and grown for 3 hours at 28°C. The cells were collected by 
centrifugation- at 2800* rpm for 20 min-and> resuspendedv in five ml of the 
appropriate., medium, i.e.t GMSBEr for»MG:i363 cellSi*or-LM9BCm for MG1820 
[pILPOL] cells** After-^a further»^five«hours-®f'*gr0wt*w»ithe'^celljs*were»*pelleted". The 
30 proteins present in the medium fractions were recovered by phenol extraction and 
ethanol precipitation. 

The proteins expressed in the culture supernatant fraction of a L. lactis 
MG1820 control strain compared to L lactis MG1820 strains transformed with 
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[pILPOL; pL2mTFF1v1l or [pILPOL; pL2rTFF3] are shown in Figure 2. As can be 
observed from this figure, the expression of the recombinant genes is quite low. 
This renders the observed in vivo result surprising since others use purified trefoil 
peptides in therapies for the repair of gastric and intestinal injury at dramatically 
5 higher levels; e.g. Tran et al. (1999) used daily Intrarectal application of human 
recombinant TTF2 at levels of 2.5 mg/kg body weight for five days to obtain a 
reduction In the inflammatory Index of experimentally Installed colitis in rats 
(Intracolonic administration of dinitrobenzene sulphonic acid in alcohol). 



1 0 Preparation of cells for oral administration 

Transformants of L. lactis strains, MG1363 [pTREXI], MG1363 
[pTImTFFI] and MG1363 [pTIrTFFS] were streaked on GM17Er plates and 
grown overnight at 28°C. In each case a single colony was subsequently grown 

15 overnight at 28"C in 15 ml GM17Er medium. To this culture, 15 ml 100% glycerol 
was added in order to preserve said cells at -20**C. Each day, the" necessary 
amount of cells could be inoculated for treatment of mice. To this end the culture 
was diluted 1/200 into 10 ml GM17Er medium. After minimum 20 hours of growth 
at SO'C, the cells were collected by centrlfugation for 15 min at 2800 rpm. The 

20 ceils were then resuspended in 1 ml M9B without antibiotic. 

From this culture 100 \i\ (containing a minimum of 1.10® cells) were 
administrated intragastric per mouse. 



In vivo tests In mice with acute colitis 

25 

The effect of the trefoil peptides expressed from these L. lactis bacteria 
was tested out in mtee suffering from acute colitis. Twenty-one female Balb/c mice 
received 5% DSS (dextrane sodium sulphate) dissolved in their drinking water 
during 7 days. In this manner, acute colitis was induced (Kojouharoff etal., 1997). 
30 For therapeutic purposes these mice were orally inoculated by means of a gastric 
catheter using lOOpI bacterial suspension (minimum 1.10® cells) from day 1 until 
day 7 of the DSS treatment. Six mice were inoculated with MG1363 [pTREXI] 
cells (group 1), six mice were inoculated with MG1363 [pTImTFFI] cells (group 2) 
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and three mice were Inoculated with MG1363 [pT1rTFF3] cells (group 3) and three 
mice were not inoculated (group 4). On day 8 after the induction of colitis, the mice 
were sacrificed and examined immunologically and histologically. 

Immunological testing of the sera showed that the treated mice did not 
5 show an immune response towards the expressed proteins. Serum was taken 
from the mice which were bled at day 8. This serum was analysed via Western 
blotting to check whether it contained antibodies against the proteins present in 
the medium fractions of the L lactis cells. The medium fractions used were 
derived from the L lactis strains MG1363 [pTREXI]; MG1363 [pTImTFFI] and 

10 MG1363 [pTIrTFFS], An equivalent of 1 ml of concentrated (phenol extraction 
and ethanol precipitation) medium fractions were analysed by SDS-polyacrylamide 
(20%) gel electrophoresis. After blotting to nitrocellulose filters, the filters were 
incubated for 1 hour with the serum solutions ©f the 4 groupS'Of mice. The serum 
was diluted SOO itimes in 20ml nitrocellulose blocking buffer* (Blotto: 100ml lOx 

15 PBS. 150mMMfcNa®l.::;.2ml*Triton.*X?.1O@^25g-*fat*free mtlkp.owder, water up to a 
total volume<©fH litefi))!»ASt!a seeondaiy»antib*©dy?tsheepnantisnn©use IgG coupled to 
horseradish peroxidase (HRPf was5^usedrUsipigs.the«*5O0ttimes diluted serum, no 
signal was deteetedr 

Hlst®logigal>analyslswwas performed -on colons of the-streated mice. The 

20 colons were cut in the length direction and divided in three equal portions: the 
distal (nearest to the anus), middle and proximal parts. These colon parts were 
analysed histologically after an overnight fixation In 3.7% formaldehyde (in PBS), 
followed by paraffin embedding, ensuring upright positioning of the tissue samples 
in the paraffin blocks. Of each tissue sample, three parallel Spm thick longitudinal 

25 sections, evenly spaced over the sample, were made. These crossections were 
coloured with hematoxylin/eosin. Histological analysis was-'performed In a blind 
fashion, meaning that the labels on the slides were covered- before scoring the 
sections. Slidesr carrying sections obtained froraf the several igroups of mice were 
randomized before microscopic examination. Each slide was then assigned a 

30 histological score (ranging from 0 to 5) according to the symptomatic description 
as defined in Table 1 . 

For each mouse and for each colon part, the average score of the three 
sections was calculated. In the distal and middle parts of the colon, the 
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inflammation consisting of epithelial damage and Infiltration were the most 
pronounced. In the proximal part, almost no inflammation could be observed. The 
average histological score was calculated for both the distal and the middle colon 
part per group of animals. The final histological sum score is the sum of the two 
5 separate scores (sum score = score of epithelial damage + score of infiltration) 
and is a measure for the degree of the inflammation. The histological sum scores 
of the middle colon part and the distal colon part for each of the groups of mice is 
shown in Figures 3 and 4, respectively. 

From the histological scores as set out in Figure 3, it could be concluded 

10 that the decrease in inflammation of the middle part of the colon is exclusively due 
to the presence of the Lactococcus cells, independent from the production of 
trefoil peptides. The addition of L lactis cells caused an inflammation decrease of 
about 40%. This could imply that L. lactis cells have an effect on the intestinal 
microflora which is also at the basis of the inflammation. 

15 From the histological scores for the distal part of the colon as set out in 

Figure 4, it could be concluded that there is a clear decrease of inflammation upon 
inoculation of mice with L lactis cells producing trefoil peptides. Mice having 
received [pTImTFFI] transformed L. lactis cells show a significant reduction of the 
inflammation of more than 65%. Treatment with pT1rTFF3] L lactis transformed 

20 cells caused a reduction of inflammation of about 50%. 

As can be seen from Figure 4, the inflammatory infiltration and the 
epithelial damage in the distal part of the colon are significantly decreased 
following inoculation with recombinant L. lactis strains which secrete either mTFFI 
polypeptide or rTFF3 polypeptide. 

25 
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Score 


Epithelium damage 


Inflammatory infiltration 


0 


Normal morphology 


No infiltration 


1 


Loss of a few goblet cells 


Infiltration around the basis of the crypts 


2 


Widespread loss of goblet 
cells 


Infiltration which reaches the Lamina 
muscularis mucosae 


3 


Loss of crypts 


Extensive infiltration which reaches the 
Lamina muscularis mucosae and thiclcening 
of the mucosa with prominent oedema 


4 


Widespread loss of crypts 


Infiltration which reaches the Lamina 
submucosa 



Table 1 : Symptomatlc^descriptlon^^^ 

Inflammatory infiltratiom^includes infiltration^of^the granulocytes, macrophages and 
lymphocytes. 
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CLAIMS 

1 . A recombinant micro-organism, preferably a bacterial strain, preferably a non- 
pathogenic strain, preferably a non-invasive strain, preferably a food grade 
strain, preferably a gram-positive bacterial strain, delivering a trefoil peptide in 
vivo. 

2. A micro-organism according to claim 1, wherein said gram-positive bacterial 
strain is a Lactococcus or a Lactobacillus species, 

3. A micro-organism according to claim 1 or 2 wherein said trefoil peptide is an 
intestinal trefoil polypeptide (TFF3). 

4. A micro-organism according to claim 1 or 2 wherein said trefoil polypeptide is 
TFF1. 

5. Method for the delivery of trefoil peptide to the gastro-intestinal tract 
comprising the administration of a micro-organism according to claims 1 to 4, 

6. Pharmaceutical composition comprising a micro-organism according to any of 
claims 1 to 4. 

7. Method of treatment of all types of gastric and intestinal diseases and 
disorders comprising the administration of a micro-organism according to any 
of claims 1 to 4. 

8. Use of a micro-organism according to any of claims 1 to 4 for the preparation 
of a medicament for treatment of acute gastro-intestinal inflammatory 
diseases, such as acute colitis, including but not limited to acute flare-ups of 
Crohn's disease and ulcerative colitis. 

9. Method for producing a micro-organism according to any of claims 1 to 4 
comprising transforming a micro-organism with a recombinant vector carrying 
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a trefoil polypeptide coding sequence under the control of a suitable promoter 
and a suitable secretion signal sequence. 

10. Recombinant vector comprising .a trefoil peptide coding sequence under the 
control of a suitable promoter sequence and a suitable secretion signal 
sequence. 



11. Recombinant vector according to claim 10, having a nucleotide sequence as 
presented in any of the figures lb, 1c, Id and 1e. 
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ABSTRACT 

The present invention relates to a micro-organism, preferably a bacterial 
strain, preferably a non-pathogenic strain, preferably a non-invasive strain, 
5 preferably a food grade strain, preferably a gram-positive bacterial strain, 
delivering a trefoil peptide in vivo. Preferably said trefoil peptide is an intestinal 
trefoil polypeptide (TFF3) or TFF1. The present invention further relates to a 
method for the delivery of trefoil peptide to the gastro-intestinal tract comprising 
the administration of such a bacterial strain. The present invention also relates to 

10 a pharmaceutical composition comprising a trefoil peptide delivering bacterium as 
well as methods of treatment of acute gastro-intestinal inflammatory diseases 
comprising administration of said transformed bacterial strains, particularly for 
treating acute colitis, including but not limited to acute flare-ups of Crohn's disease 
and ulcerative colitis in humans, as well as for treating gastro-intestinal disorders 

15 of a similar nature in other animal species. 



Figure lb : 
pL2rTFF3 

GAATTCGAGCTCGGTACCCGGGGATCTCGATCCCGCGAAATTAATACGACTC 
ACTATAGGGAGACCACAACGGTTTCCCTCTAGAAATAATTTTGTTTAACTTT 



TACAGTCATACTTTCTGCTGCAGCCCCGTTGTCAGGTGTTTACGCCCAGGAA 
TTTGTTGGCCTATCTCCAAGGCAATGTATGGCTCCAACAAATGTCAGGGTGG 
ACTGTAACTACCCCACTGTCACATCAGAGGAGTGTAACAACCGTGGTTGCTG 
TTTTGA€-TCCAGCATCCCAAATGTGCCCTGGTGCTTCAAACCTCTGCAAGAG 
ACAGAATGTACATTTTAACTAGTAGATCCGGCTGCTAACAAAGCCCGAAAGG 
AAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATAACTAGCATAACCCCTTG6 
GGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAACTATATCC 
GGATGACCTGCAGGCATGCAAGCTTGGCACTGGCCGTCGTTTTACAACGTC6 
TGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCC 
CCTTTCGCCAGCT6ATTTCACTTTTTGCATTCTACAAACTGCATAACTCATA 
TGTAAATCGCTCCTTTTTAGGTGGCACAAATGTGAGGCATTTTCGCTCTTTC 
CGGCGAGGCTAGTTACCCTTAAGTTATTGGTATGACTGGTTTTAAGCGCAAA 
AAAAGTTGCTTTTTCGTACCTATTAATGTATCGTTTTAGAAAACCGACTGTA 
AAAAGTACAGTCGGCATTATCTCATATTATAAAAGCCAGTCATTAGGCCTAT 
CTGACAATTCCTGAATAGAGTTCATAAACAATCCTGCATGATAACCATCACA 
AACAGAATQATGTAeCTGTAAAGATAGCGGTAAATATATTGAATTACCTTTA 
TTAATGAATTTTCCTGC.TGTAATAATGGGTAGAAGG3?AATTACTATTATTAT 



ATAAAGTSGCTCTAAeTTATCCCAATAACCTAACTCTCCGTeGCTATTGTAA 
CCAGTTCTAAAAGCTGTATTTGAGTTTATCACCCTTGTCACTAAGAAAATAA 



AATATCAATTTCTGTGGTTATACTAAAAGTCGTTTGTTGGTTCAAATAATGA 
TTAAATATCTCTTTTCTCTTCCAATTGTCTAAATCJUVTTTTATTAAAGTTCA 
TTTGATATGCCTCCTAAATTTTTATCTAAAGTGAATTTAGGAGGCTTACTTG 
TCTGCTTTCTTCATTAGAATCAATCCTTTTTTAAAAGTCAATATTACTGTAA 
CATAAATATATATTTTAAAAATATCCCACTTTATCCAATTTTCGTTTGTTGA 
ACTAATGGGTGCTTTAGTTGAAGAATAAAGACCACATTAAAAAATGTGGTCT 
TTTGTGTTTTTTTAAAGGATTTGAGCGTAGCGAAAAATCCTTTTCTTTCTTA 
TCTTGATAATAAGGGTAACTATTGCCGGGATAGACTGTAACATTCTCACGCA 
TAAAATCCCCTTTCATTTTCTAATGTAAATCTATTACCTTATTATTAATTCA 
ATTCGCTCATAATTAATCCTTTTTCTTATTACGCAAAATGGCCCGATTTAAG 
CACACCCTTTATTCCGTTAATGCGCCATGACAGCCATGATAATTACTAATAC 



TCGTATAACAGATGCGATGATGCAGACCAATCAACATGGCACCTGCCATTGC 
TACCTGTACAGTCAAGGATGGTAGAAATGTTGTCGGTCCTTGCACACGAATA 
TTACGCCATTTGCCTGCATATTCAAACAGCTCTTCTACGATAAGGGCACAAA 
TCGCATCGTGGAACGTTTGGGCTTCTACCGATTTAGCAGTTTGATACACTTT 
CTCTAAGTATCCACCTGAATCATAAATCGGCAAAATAGAGAAAAATTGACCA 
TGTGTAAGCGGCCAATCTGATTCCACCTGAGATGCATAATCTAGTAGAATCT 



Figure lb - continued - 

CTTCGCTATCAAAATTCACTTCCACCTTCCACTCACCGGTTGTCCATTCATG 

GCTGAACTCTGCTTCCTCTGTTGACATGACACACATCATCTCAATATCCGAA 

TAGGGCCCATCAGTCTGACGACCAAGAGAGCCATAAACACCAATAGCCTTAA 

CATCATCCCCATATTTATCCAATATTCGTTCCTTAATTTCATGAACAATCTT 

CATTCTTTCTTCTCTAGTCATTATTATTGGTCCATTCACTATTCTCATTCCC 

TTTTCAGATAATTTTAGATTTGCTTTTCTAAATAAGAATATTTGGAGAGCAC 

CGTTCTTATTCAGCTATTAATAACTCGTCTTCCTAAGCATCCTTCAATCCTT 

TTAATAACAATTATAGCATCTAATCTTCAACAAACTGGCCCGTTTGTTGAAC 

TACTCTTTAATAAAATAATTTTTCCGTTCCCAATTCCACATTGCAATAATAG 

AAAATCCATCTTCATCGGCTTTTTCGTCATCATCTGTATGAATCAAATCGCC 

TTCTTCTGTGTCATCAAGGTTTAATTTTTTATGTATTTCTTTTAACAAACCA 

CCATAGGAGATTAACCTTTTACGGTGTAAACCTTCCTCCAAATCAGACAAAC 

GTTTCAAATTCTTTTCTTCATCATCGGTCATAAAATCCGTATCCTTTACAGG 

ATATTTTGCAGTTTCGTCAATTGCCGATTGTATATCCGATTTATATTTATTT 

TTCGGTATTTTTTATTAAAACGTCTCAAAATCGTTTCTGGGACGTTTTAGCG 

TTTATTTCGTTTAGTTATCGGCATAATCGTTAAAACAGGCGTTATCGTAGCG 

TAAAAGCCCTTGAGCGTAGCGTGCTTTGCAGCGAAGATGTTGTCTGTTAGAT 

TATGAAAGCCGATGACTGAATGAAATAATAAGCGCAGCGTCCTTCTATTTCG 

GTTGGAGGAGGCTCAAGGGAGTTTGAGGGAATGAAATTCCCTCATGGGTTTG 

ATTTTAAAAATTGCTTGCAATTTTGCCGAGCGGTAGCGCTGGAAAATTTTTG 

AAAAAAATTTGGAATTTGGAAAAAAATGGGGGGAAAGGAAGCGAATTTTGCT 

TCCGTACTACGACCCCCCATTAAGTGCCGAGTGCCAATTTTTGTGCCAAAAA 

CGCTCTATCCCAACTGGCTCAAGGGTTTGAGGGGTTTTTCAATCGCCAACGA 

ATCGCCAACGTTTTCGCCAACGTTTTTTATAAATCTATATTTAAGTAGCTTT 

ATTGTTGTTTTTATGATTACAAAGTGATACACTAATTTTATAAAATTATTTG 

ATTGGAGTTTTTTAAATGGTGATTTCAGAATCGAAAAAAAGAGTTATGATTT 

CTCTGACAAAAGAGCAAGATAAAAAATTAACAGATATGGCGAAACAAAAAGG 

TTTTTCAAAATCTGCGGTTGCGGCGTTAGCTATAGAAGAATATGCAAGAAAG 

GAATCAGAACAAAAAAAATAAGCGAAAGCTCGCGTTTTTAGAAGGATACGAG 

TTTTCGCTACTTGTTTTTGATAAGGTAATATATCATG'GCTATTAAAAATACT 

AAAGCTAGAAATTTTGGATTTTTATTATATCCTGACTCAATTCCTAATGATT 

GGAAAGAAAAATTAGAGAGTTTGGGCGTATCTATGGCTGTCAGTCCTTTACA 

CGATATGGACGAAAAAAAAGATAAAGATACATGGAATAGTAGTGATGTTATA 

CGAAATGGAAAGCACTATAAAAAACCACACTATCACGTTATATATATTGCAC 

GAAATCCTGTAACAATAGAAAGCGTTAGGAACAAGATTAAGCGAAAATTGGG 

GAATAGTTCAGTTGCTCATGTTGAGATACTTGATTATATCAAAGGTTCATAT 

GAATATTTGACTCATGAATCAAAGGACGCTATTGCTAAGAATAAACATATAT 

ACGACAAAAAAGATATTTTGAACATTAATGATTTTGATATTGACCGCTATAT 

AACACTTGATGAAAGCCAAAAAAGAGAATTGAAGAATTTACTTTTAGATATA 

GTGGATGACTATAATTTGGTAAATACAAAAGATTTAATGGCTTTTATTCGCC 

TTAGGGGAGCGGAGTTTGGAATTTTAAATACGAATGATGTAAAAGATATTGT 

TTCAACAAACTCTAGCGCCTTTAGATTATGGTTTGAGGGCAATTATCAGTGT 

GGATATAGAGCAAGTTATGCAAAGGTTCTTGATGCTGAAACGGGGGAAATAA 

AATGACAAACAAAGAAAAAGAGTTATTTGCTGAAAATGAGGAATTAAAAAAA 

GAAATTAAGGACTTAAAAGAGCGTATTGAAAGATACAGAGAAATGGAAGTTG 

AATTAAGTACAACAATAGATTTATTGAGAGGAGGGATTATTGAATAAATAAA 

AGCCCCCCTGACGAAAGTCGCGACTTCGTTCTTTTTTTACCTCTCGGTTATG 

AGTTAGTTCAAATTCGTTCTTTTTAGGTTCTAAATCGTGTTTTTCTTGGAAT 

TGTGCTGTTTTATCCTTTACCTTGTCTACAAACCCCTTAAAAACGTTTTTAA 

AGGCTTTTAAGCCGTCTGTACGTTCCTTAAG 



Figure 1c: 

pL2mTFF1v1 

GAATTCGAGCTCGGTACCCGGGGATCTCGATCCCGCGAAATTAATACGACTC 
ACTATAGGGAGACCACAACGGTTTCCCTCTAGAAATAATTTTGTTTAACTTT 
AAGAAGGAGATATACATATGAAAAAAAAGATTATCTCA6GTATTTTAATGTC 
TACAGTCATACTTTCTGCTGCAGCCCCGTTGTCAGGTGTTTAGGCCCAGGCC 
CAGGCCCAGGCCCAGGAAGAAACATGTATCATGGCCCCGCGGGAGAGGATAA 
ATTGTGGCTTCCCCGGTGTCACCGCCCAGCAGTGCACGGAGAGAGGTTGCTG 
TTTTGATGACAGTGTCCGGGGATTCCCGTGGTGCTTCCAGCCCATGGGCATC 
GAGAACACTCAAGAAGAAGAATGTCCCTTCTAACTAGTAGATCCGGCTGCTA 
ACAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATAACT 
AGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAA 
GGAGGAACTATATCCGGATGACCTGCAGGCATGCAAGCTTGGCACTGGCCGT 
CGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGC 
CTTGCAGCACATCCCCCTTTCGCCAGCTGATTTCACTTTTTGCATTCTACAA 
ACTGCATAACTCATATGTAAATCGCTCCTTTTTAGGTGGCACAAATGTGAGG 
CATTTTCGCTCTTTCCGGCGAGGCTAGTTACCCTTAAGTTATTGGTAT6ACT 
GGTTTTAAGCGCAAAAAAAGTTGCTTTTTCGTACCTATTAATGTATCGTTTT 
AGAAAACCGACTGTAAAAAGTACAGTCGGCATTATCTCATATTATAAAAGCC 



ATGATAACCATCAGAAACAGAATGATGTAGCTGTAAAGATAGCGGTAAATAT 



TGGTTCAAATAATGATTAAATATCTCTTTTCTCTTCCAATTGTCTAAATCAA 
TTTTATTAAAGTTCATTTGATATGCCTCCTAAATTTTTATCTAAAGTGAATT 
TAGGAGGCTTACTTGTCTGCTTTCTTCATTAGAATCAATCCTTTTTTAAAAG 
TCAATATTACTGTAACATAAATATATATTTTAAAAATATCCCACTTTATCCA 
ATTTTCGTTTGTTGAACTAATGGGTGCTTTAGTTGAAGAATAAAGACCACAT 
TAAAAAATGTGGTCTTTTGTGTTTTTTTAAAGGATTTGAGCGTAGCGAAAAA 
TCCTTTTCTTTCTTATCTTGATAATAAGGGTAACTATTGCCGGGATAGACTG 
TAACATTCTCACGCATAAAATCCCCTTTCATTTTCTAATGTAAATCTATTAC 
CTTATTATTAATTCAATTCGCTCATAATTAATCCTTTTTCTTATTACGCAAA 
ATGGCCCGATTTAAGCACACCCTTTATTCCGTTAATGCGCCATGACAGCCAT 



[©KAg'ATCTGATTGeTTJ^eTGGTfl 
AGACCGAAGCGCTCGTCGTATAACAGATGCGATGATGCAGACCAATCAACAT 
GGCACCTGCCATTGCTACCTGTACAGTCAAGGATGGTAGAAATGTTGTCGGT 
CCTTGCACACGAATATTACGCCATTTGCCTGCATATTCAAACAGCTCTTCTA 
CGATAAGGGCACAAATCGCATCGTGGAACGTTTGGGCTTCTACCGATTTAGC 
AGTTTGATACACTTTCTCTAAGTATCCACCTGAATCATAAATCGGCAAAATA 
GAGAAAAATTGACCATGTGTAAGCGGCCAATCTGATTCCACCTGAGATGCAT 
AATCTAGTAGAATCTCTTCGCTATCAAAATTCACTTCCACCTTCCACTCACC 
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GGTTGTCCATTCATGGCTGAACTCTGCTTCCTCTGTTGACATGACACACATC 

ATCTCAATATCCGAATAGGGCCCATCAGTCTGACGACCAAGAGAGCCATAAA 

CACCAATAGCCTTAACATCATCCCCATATTTATCCAATATTCGTTCCTTAAT 

TTCATGAACAATCTTCATTCTTTCTTCTCTAGTCATTATTATTGGTCCATTC 

ACTATTCTCATTCCCTTTTCAGATAATTTTAGATTTGCTTTTCTAAATAAGA 

ATATTTGGAGAGCACCGTTCTTATTCAGCTATTAATAACTCGTCTTCCTAAG 

CATCCTTCAATCCTTTTAATAACAATTATAGCATCTAATCTTCAACAAACTG 

GCCCGTTTGTTGAACTACTCTTTAATAAAATAATTTTTCCGTTCCCAATTCC 

ACATTGCAATAATAGAAAATCCATCTTCATCGGCTTTTTCGTCATCATCTGT 

ATGAATCAAATCGCCTTCTTCTGTGTCATCAAGGTTTAATTTTTTATGTATT 

TCTTTTAACAAACCACCATAGGAGATTAACCTTTTACGGTGTAAACCTTCCT 

CCAAATCAGACAAACGTTTCAi^ATTCTTTTCTTCATCATCGGTCATAAAATC 

CGTATCCTTTACAGGATATTTTGCAGTTTCGTCAATTGCCGATTGTATATCC 

GATTTATATTTATTTTTCGGTATTTTTTATTAAAACGTCTCAAAATCGTTTC 

TGGGACGTTTTAGCGTTTATTTCGTTTAGTTATCGGCATAATCGTTAAAACA 

GGCGTTATCGTAGCGTAAAAGCCCTTGAGCGTAGCGTGCTTTGCAGCGAAGA 

TGTTGTCTGTTAGATTATGAAAGCCGATGACTGAATGAAATAATAAGCGCAG 

CGTCCTTCTATTTCGGTTGGAGGAGGCTCAAGGGAGTTTGAGGGAATGAAAT 

TCCCTCATGGGTTTGATTTTAAAAATTGCTTGCAATTTTGCCGAGCGGTAGC 

GCTGGAAAATTTTTGAAAAAAATTTGGAATTTGGAAAAAAATGGGGGGAAAG ' 

GAAGCGAATTTTGCTTCCGTACTACGACCCCCCATTAAGTGCCGAGTGCCAA 

TTTTTGTGCCAAAAACGCTCTATCCCAACTGGCTCAAGGGTTTGAGGGGTTT : 

TTCAATCGCCAACGAATCGCCAACGTTTTCGCCAACGTTTTTTATAAATCTA 

TATTTAA6TAGCTTTATTGTTGTTTTTATGATTACAAAGTGATACACTAATT . 

TTATAAAATTATTTGATTGGAGTTTTTTAAATGGTGATTTCAGAATCGAAAA 

AAAGAGTTATGATTTCTCTGACAAAAGAGCAAGATAAAAAATTAACAGATAT 

GGCGAAACAAAAAGGTTTTTCAAAATCTGCGGTTGCGGCGTTAGCTATAGAA 

GAATATGCAAGAAAGGAATCAGAACAAAAAAAATT^GCGAAAGCTCGCGTTT 

TTAGAAGGATACGAGTTTTCGCTACTTGTTTTTGATAAGGTAATATATCATG 

GCTAT^TAAAAATACTAAAGCTAGT^AATTTTGGATTTTTATTATATCCTGACT 

CAATTCCTAATGATTGGAAAGAAAAATTAGAGAGTTTGGGCGTATCTATGGC 

TGTCAGTCCTTTACACGATATGGACGAAAAAAAAGATAAAGATACATGGAAT 

AGTAGTGATGTTATACGAAATGGAAAGCACTATAAAAAACCACACTATCACG 

TTATATATATTGCACGAAATCCTGTAACAATAGAAAGCGTTAGGAACAAGAT : 

TAAGCGAAAATTGGGGAATAGTTCAGTTGCTCATGTTGAGATACTTGATTAT 

ATCAAAGGTTCATATGAATATTTGACTCATGAATCAAAGGACGCTATTGCTA 

AGAATAAACATATATACGACAAAAAAGATATTTTGAACATTAATGATTTTGA 

TATTGACCGCTATATAACACTTGATGAAAGCCAAAAAAGAGAATTGAAGAAT 

TTACTTTTAGATATAGTGGATGACTATAATTTGGTAAATACAAAAGATTTAA 

TGGCTTTTATTCGCCTTAGGGGAGCGGAGTTTGGAATTTTAAATACGAATGA 

TGTAAAAGATATTGTTTCAACAAACTCTAGCGCCTTTAGATTATGGTTTGAG 

GGCAATTATCAGTGTGGATATAGAGCAAGTTATGCAAAGGTTCTTGATGCT6 

AAACGGGGGAAATAAAATGACAAACAAAGAAAAAGAGTTATTTGCTGAAAAT 

GAGGAATTAAAAAAAGAAATTAAGGACTTAAAAGAGCGTATTGAAAGATACA 

GAGAAATGGAAGTTGAATTAAGTACAACAATAGATTTATTGAGAGGAGG6AT 

TATTGAATAAATAAAAGCCCCCCTGACGAAAGTCGCGACTTCGTTCTTTTTT 

TACCTCTCGGTTATGAGTTAGTTCAAATTCGTTCTTTTTAGGTTCTAAATCG 

TGTTTTTCTTGGAATTGTGCTGTTTTATCCTTTACCTTGTCTACAAACCCCT 

TAAAAACGTTTTTAAAGGCTTTTAAGCCGTCTGTACGTTCCTTAAG 



Figure 1d: 
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GAATTCGATTAAGTCATCTTACCTCTTTTATTAGTTTTTTCTTATAATCTAA 
TGATAACATTTTTATAATTAATCTATAAACCATATCCCTCTTTGGAATCAAA 



GAGACCAGAACGGTTTCCCACTAGAAATAATTTTGTTTAACTTTAGAAAGGA 
GATATACGCATGAAAAAAAAGATTATCTCAGCTATTTTAATGTCTAGAGTCA 
TACTTTCTGCTGCAGCCCCGTTGTCAGGTGTTTACGCCCAGGAATTTGTTGG 
CCTATGTeCAAGGGAATGTATGGCTCCAAGAAATGTCAGGGTGGACTGTAAC 
TACCCCACTGTCACATCAGAGCAGTGTAACAACCGTGGTTGCTGTTTTGACT 
CCAGCATCCCAAATGTGCCCTGGTGCTTCAAACCTCTGCAAGAGACAGAATG 
TACATTTTAACTAGTAGATCCGGCTGCTAACAAAGCCCGAAAGGAAGCTGAG 
TTGGCTGCTGCCACCGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTA 
AACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAACTATATCCGGATGACC 
TGCAGGCAAGCTCTAGAATCGATACGATTTTGAAGTGGCAACAGATAAAAAA 
AAGCAGTTTAAAATTGTTGCTGAACTTTTAAAACAAGCAAATACAATCATT6 
TCGCAACAGATAGCGACAGAGAAGGCGAAAACATTGCCTGGTCGATCATTCA 
TAAAGCAAATGCCTTTTCTAAAGATAAAACGTATAAAAGACTATGGATCAAT 
AGTTTAGAAAAAGATGTGATCCGTAGCGGTTTTCAAAATTTGCAACCAGGAA 



UTA'iEGGAAAaiJAA^ AC 

iSAAGAGG^i^fi 
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TATAAAATTGTTGGGAGTATTCCTTACCATTTAAGCACACAAATTATTAAAA 
AAGTGGTTTTTGAAAGCCATGCGTCTGACATCTATCTGATTGTTGAAGAAGG 
ATTCTACAAGCGTACCTTGGATATTCACCGAACACTAGGGTTGCTCTTGCAC 
ACTCAAGTCTCGATTCAGCAATTGCTTAAGCTGCCAGCGGAATGCTTTCATC 
CTAAACCAAAAGTAAACAGTGTCTTAATAAAACTTACCCGCCATACCACAGA 
TGTTCCAGATAAATATTGGAAGCTATATACGTACTTTGTTTCAAAATGGGTC 
AATCGAGAATATCGTCAACTGTTTACTAAAAATCAGTTTCATCAAGCAATGA 
AACACGCCAAAGTAAACAATTTAAGTACCGTTACTTATGAGCAAGTATTGTC 
TATTTTTAATAGTTATCTATTATTTAACGGGAGGAAATAATTCTATGAGTCG 
CTTTTGTAAATTTGG AAAGTTAC AC GTTAC TAAAGGGAATGTAGATAAATTA 

GAACGTGCTCTCCAAGACCAGTTACAAAGAGCTAGTGCACTAAACATAATTA 
TTAACGCTATAAGTGTGTGGAACACTGTATATATGGAAAAAGCCGTAGAAGA 
ATTAAAAGCAAGAGGAGAATTTAGAGAAGATTTAATGCCATATGCGTGGCCG 
TTAGGATGGGAACATATCAATTTTCTTGGAGAATACAAATTTGAAGGATTAC 
ATGACACTGGGCAAATGAATTTACGTCCTTTACGTATAAAAGAGCCGTTTTA 
TTCTTAATATAACGGCTCTTTTTATAGAAAAAATCCTTAGCGTGGTTTTTTT 
CCGAAATGCTGGCGGTACCCCAAGAATTAGAAATGAGTAGATCAAATTATTC 
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ACGAATAGAATCAGGAAAATCAGATCCAACCATAAAAACACTAGAACAAATT 

GCAAAGTTAACTAACTCAACGCTAGTAGTGGATTTAATCCCAAATGAGCCAA 

CAGAACCAGAGCCAGAAACAGAATCAGAACAAGTAACATTGGATTTAGAAAT 

GGAAGAAGAAAAAAGCAATGACTTCGTGTGAATAATGCACGAAATCGTTGCT 

TATTTTTTTTTAAAAGCGGTATACTAGATATAACGAAACAACGAACTGAATA 

GAAACGAAAAAAGAGCCATGACACATTTATAAAATGTTTGACGACATTTTAT 

AAATGCATAGCCCGATAAGATTGCCAAACCAACGCTTATCAGTTAGTCAGAT 

GAACTCTTCCCTCGTAAGAAGTTATTTAATTAACTTTGTTTGAAGACGGTAT- 

ATAACCGTACTATCATTATATAGGGAAATCAGAGAGTTTTCAAGTATCTAAG 

CTACTGAATTTAAGAATTGTTAAGCAATCAATCGGAAATCGTTTGATTGCTT 

TTTTTGTATTCATTTATAGAAGGTGGAGTTTGTATGAATCATGATGAATGTA 

AAACTTATATAAAAAATAGTTTATTGGAGATAAGAAAATTAGCAAATATCTA 

TACACTAGAAACGTTTAAGAAAGAGTTAGAAAAGAGAAATATCTACTTAGAA 

ACAAAATCAGATAAGTATTTTTCTTCGGAGGGGGAAGATTATATATATAAGT 

TAATAGAAAATAACAAAATAATTTATTCGATTAGTGGAAAAAAATTGACTTA 

TAAAGGAAAAAAATCTTTTTCAAAACATGCAATATTGAAACAGTTGAATGAA 

AAAGCAAACCAAGTTAATTAAACAACCTATTTTATAGGATTTATAGGAAAGG 

AGAACAGCTGAATGAATATCCCTTTTGTTGTAGAAACTGTGCTTCATGACG6 

CTTGTTAAAGTACAAATTTAAAAATAGTAAAATTCGCTCAATCACTACCAAG 

CCAGGTAAAAGCAAAGGGGCTATTTTTGCGTATCGCTCAAAATCAAGCATGA 

TTGGCGGTCGTGGTGTTGTTCTGACTTCCGAGGAAGCGATTCAAGAAAATCA 

AGATACATTTACACATTGGACACCCAACGTTTATCGTTATGGAACGTATGCA 

GACGAAAACCGTTCATACACGAAAGGACATTCTGAAAACAATTTAAGACAAA 

TCAATACCTTCTTTATTGATTTTGATATTCACACGGCAAAAGAAACTATTTC 

AGCAAGCGATATTTTAACAACCGCTATTGATTTAGGTTTTATGCCTACTATG 

ATTATCAAATCTGATAAAGGTTATCAAGCATATTTTGTTTTAGAAACGCCAG 

TCTATGTGACTTCAAAATCAGAATTTAAATCTGTCAAAGCAGCCAAAATAAT 

TTCGCAAAATATCCGAGAATATTTTGGAAAGTCTTTGCCAGTTGATCTAACG 

TGTAATCATTTTGGTATTGCTCGCATACCAAGAACGGACAATGTAGAATTTT 

TTGATCCTAATTACCGTTATTCTTTCAAAGAATGGCAAGATTGGTCTTTCAA 

ACAAACAGATAATAAGGGCTTTACTCGTTCAA6TCTAACGGTTTTAAGCGGT 

ACAGAAGGCAAAAAACAAGTAGATGAACCCTGGTTTAATCTCTTATTGCACG 

AAACGAAATTTTCAGGAGAAAAGGGTTTAATAGGGCGTAATAACGTCATGTT 

TACCCTCTCTTTAGCCTACTTTAGTTCAGGCTATTCAATCGAAACGTGCGAA 

TATAATATGTTTGAGTTTAATAATCGATTAGATCAACCCTTAGAAGAAAAAG 

AAGTAATCAAAATTGTTAGAAGTGCCTATTCAGAAAACTATCAAGGGGCTAA 

TAGGGAATACATTACCATTCTTTGCAAAGCTTGGGTATCAAGTGATTTAACC 

AGTAAAGATTTATTTGTCCGTCAAGGGTGGTTTAAATTCAAGAAAAAAAGAA 

GCGAACGTCAACGTGTTCATTTGTCAGAATGGAAAGAAGATTTAATGGCTTA 

TATTAGCGAAAAAAGCGATGTATACAAGCCTTATTTAGTGACGACCAAAAAA 

GAGATTAGAGAAGTGCTAGGCATTCCTGAACGGACATTAGATAAATTGCTGA 

AGGTACTGAAGGCGAATCAGGAAATTTTCTTTAAGATTAAACCAGGAAGAAA 

TGGTGGCATTCAACTTGCTAGTGTTAAATCATTGTTGCTATCGATCATTAAA 

GTAAAAAAAGAAGAAAAAGAAAGCTATATAAAGGCGCTGACAAATTCTTTTG 

ACTTAGAGCATACATTCATTCAAGAGACTTTAAACAAGCTAGCAGAACGCCC 

TAAAACGGACACACAACTCGATTTGTTTAGCTATGATACAGGCTGAAAATAA 

AACCCGCACTATGCCATTACATTTATATCTATGATACGTGTTTGTTTTTTCT 

TTGCTGTTTAGCGAATGATTAGCAGAAATATACAGAGTAAGATTTTAATTAA 

TTATTAGGGGGAGAAGGAGAGAGTAGCCCGAAAACTTTTAGTTGGCTTG6AC 

TGAACGAAGTGAGGGAAAGGCTACTAAAACGTCGAGGGGCAGTGAGAGCGAA 

GCGAACACTTGATTTTTTAATTTTCTATCTTTTATAGGTCATTAGAGTATAC 
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TTATTTGTCCTATAAACTATTTAGCAGCATAATAGATTTATTGAATAGGTCA 
TTTAAGTTGAGCATATTAGAGGAGGAAAATCTTGGAGAAATATTTGAAGAAC 
CCGATTACATGGATTGGATTAGTTCTTGTGGTTACGTGGTTTTTAACTAAAA 
GTAGTGAATTTTTGATTTTTGGTGTGTGTGTCTTGTTGTTAGTATTTGCTAG 
TCAAAGTGAa?TAAATA 




Figure 1e: 



pTImTFFI 

GAATTCGATTAAGTCATCTTACCTCTTTTATTAGTTTTTTCTTATAATCTAA 
TGATAACATTTTTATAATTAATCTATAAACCATATCCCTCTTTGGAATCAAA 
ATTTATTATCTACTCCTTTGTAGATATGTTATAATACAAGTATCAGATCTGG 
GAGACCACAACGGTTTCCCACTAGAAATAATTTTGTTTAACTTTAGAAAGGA 
GATATACGCATGAAAAAAAAGATTATCTCAGCTATTTTAATGTCTACAGTCA 
TACTTTCTGCTGCAGCCCCGTTGTCAGGTGTTTACGCCCAGGCCCAGGCCCA 
GGCCCAGGCCCAGGAAGAAACATGTATCATGGCCCCCCGGGAGAGGATAAAT 
TGTGGCTTCCCCGGTGTCACCGCCCAGCAGTGCACGGAGAGAGGTTGCTGTT 
TTGATGACAGTGTCCGGGGATTCCCGTGGTGCTTCCACCCCATGGCCATCGA 
GAACACTCAAGAAGAAGAATGTCCCTTCTAACTAGTAGATCCGGCTGCTAAC 
AAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATAACTAG 
CATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGG 
AGGAACTATATCCGGATGACCTGCAGGCAAGCTCTAGAATCGATACGATTTT 
GAAGTGGCAACAGATAAi\AAAAAGCAGTTTAAAATTGTTGCTGAACTTTTAA 
AACAAGCAAATACAATCATTGTCGCAACAGATAGCGACAGAGAAGGCGAAAA 
CATTGCCTGGTCGATCATTCATAAAGCAAATGCCTTTTCTAAAGATAAAACG 
TATAAAAGACTATGGATCAATAGTTTAGAAAAAGATGTGATCCGTAGCGGTT 
TTCAAAATTTGCAACCAGGAATGAATTACTATCCCTTTTATCAAGAAGCGCA 
AAAGAAAAACGAAATGATACACCAATCAGTGCAAAAAAAGATATAATGGGAG 
ATAAGACGGTTCGTGTTCGTGCTGACTTGCACCATATCATAAAAATCGAAAC 
AGCAAAGAATGGCGGAAACGTAAAAGAAGTTATGGAAATAAGACTTAGAAGC 
AAACTTAAGAGTGTGTTGATAGTGCAGTATCTTAAAATTTTGTATAATAGGA 
ATTGAAGTTAAATTAGATGCTAAAAATTTGTAATTAAGAAGGAGTGATTACA 
TGAACAAAAATATAAAATATTCTCAAAACTTTTTAACGAGTGAAAAAGTACT 
CAACCAAATAATAAAACAATTGAATTTAAAAGAAACCGATACCGTTTACGAA 
ATTGGAACAGGTAAAGGGCATTTAACGACGAAACTGGCTAAAATAAGTA7VAC 
AGGTAACGTCTATTGAATTAGACAGTCATCTATTCAACTTATCGTCAGT^AAA 
ATTAAAACTGAATACTCGTGTCACTTTAATTCACCAAGATATTCTACAGTTT 
CAATTCCCTAACAAACAGAGGTATAAAATTGTTGGGAGTATTCCTTACCATT 
TAAGCACACAAATTATTAAAAAAGTGGTTTTTGAAAGCCATGCGTCTGACAT 
CTATCTGATTGTTGAAGAAGGATTCTACAAGCGTACCTTGGATATTCACCGA 
ACACTAGGGTTGCTCTTGCACACTCAAGTCTCGATTCAGCAATTGCTTAAGC 
TGCCAGCGGAATGCTTTCATCCTAAACCAAAAGTAAACAGTGTCTTAATAAA 
ACTTACCCGCCATACCACAGATGTTCCAGATAAATATTGGAAGCTATATACG 
TACTTTGTTTCAAAATGGGTCAATCGAGAATATCGTCAACTGTTTACTAAAA 
ATCAGTTTCATCAAGCAATGAAACACGCCAAAGTAAACAATTTAAGTACCGT 
TACTTATGAGCAAGTATTGTCTATTTTTAATAGTTATCTATTATTTAACGGG 
AGGAAATAATTCTATGAGTCGCTTTTGTAAATTTGGAAAGTTACACGTTACT 
AAAGGGAATGTAGATAAATTATTAGGTATACTACTGACAGCTTCCAAGGAGC 
TAAAGAGGTCCCTAGCGCTCTTATCATGGGGAAGCTCGGATCATATGCAAGA 
CAAAATAAACTCGCAACAGCACTTGGAGAAATGGGACGAATCGAGAAAACCC 
TCTTTACGCTGGATTACATATCTAATAAAGCCGTAAGGAGACGGGTTCAAAA 
AGGTTTAAATAAAGGAGAAGCAATCAATGCATTAGCTAGAACTATATTTTTT 
GGACAACGTGGAGAATTTAGAGAACGTGCTCTCCAAGACCAGTTACAAAGAG 
CTAGTGCACTAAACATAATTATTAACGCTATAAGTGTGTGGAACACTGTATA 
TATGGAAAAAGCCGTAGAAGAATTAAAAGCAAGAGGAGAATTTAGAGAAGAT 
TTAATGCCATATGCGTGGCCGTTAGGATGGGAACATATCAATTTTCTTGGAG 
AATACAAATTTGAAGGATTACATGACACTGGGCAAATGAATTTACGTCCTTT 
ACGTATAAAAGAGCCGTTTTATTCTTAATATAACGGCTCTTTTTATAGAAAA 
AATCCTTAGCGTGGTTTTTTTCCGAAATGCTGGCGGTACCCCAAGAATTAGA 
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AATGAGTAGATCAAATTATTCACGAATAGAATCAGGAAAATCA6ATCCAACC 
ATAAAAACACTAGAACAAATTGCAAAGTTAACTAACTCAACGCTAGTAGTGG 
ATTTAATCCCAAATGAGCCAACAGAACCAGAGCCAGAAACAGAATCAGAACA 
AGTAACATTGGATTTAGAAATGGAAGAAGAAAAAAGCAATGAGTTeGTGTGA 
ATAATGCACGAAATCGTTGCTTATTTTTTTTTAAAAGCGGTATACTAGATAT 
AACGAAACAAGGAACTGAATAGAAACGAAAAAAGAGCCATGACACATTTATA 
AAATGTTTGACGACATTTTATAAATGCATAGCCCGATAAGATTGCCAAACCA 
ACGCTTATeAGTTAGTCAGATGAACTCTTCCCTCGTAAGAAGTTA'M'TAATT 
AACTTTGTTTGAAGACGGTATATAACCGTACTATCATTATATAGGGAAATCA 
GAGAGTTTTCAAGTATCTAAGCTACTGAATTTAAGAATTGTTAAGCAATCAA 
TCGGAAATCGTTTGATTGCTTTTTTTGTATTCATTTATAGAAGGTGGAGTTT 
GTATGAATCATGATGAATGTAAAACTTATATAAAAAATAGTTTATTGGAGAT 
AAGAAAATTAGCAAATATCTATACACTAGAAACGTTTAAGAAAGAGTTAGAA 
AAGAGAAATATCTACTTAGAAACAAAATCAGATAAGTATTTTTCTTCGGAGG 
GGGAAGATTATATATATAAGTTAATAGAAAATAACAAAATAATTTATTCGAT 
TAGTGGAAAAAAATTGACTTATAAAGGAAAAAAATCTTTTTCAAAACATGCA 
ATATTGAAACAGTTGAATGAAAAAGCAAACCAAGTTAATTAAACAACCTATT 
TTATAGGATTTATAGGAAAGGAGAACAGCTGAATGAATATCCCTTTTGTTGT 



ATTCGCTCAATCACTACCAAGGCAGGTAAAAGCAAAGGGGCTATTTTTGCGT 
ATCGCTeAAAATCAAGCATGATTGGeGG^CGTGGTGTTGTTCTGA@.TTCCGA 

pm ~ ■ ■" 



ATTTTGTTTTAGAAACGCCAGTCTATGTGACTTCAAAA!rCAGAAT,TTAAATC 



TCTTTGCCAGTTGATCTAACGTGTAATCATTTTGGTATTGCTCGCATACCAA 
GAACGGACAATGTAGAATTTTTTGATCCTAATTACCGTTATTCTTTCAAAGA 
ATGGCAAGATTGGTCTTTCAAACAAACAGATAATAAGGGCTTTACTCGTTCA 
AGTCTAACGGTTTTAAGCGGTACAGAAGGCAAAAAACAAGTAGATGAACCCT 
GGTTTAATCTCTTATTGCACGAAACGAAATTTTCAGGAGAAAAGGGTTTAAT 
AGGGCGTAATAACGTCATGTTTACCCTCTCTTTAGCCTACTTTAGTTCAGGC 
TATTCAATCGAAACGTGCGAATATAATATGTTT6AGTTTAATAATCGATTAG 
ATCAACCCTTAGAAGAAAAAGAAGTAATCAAAATTGTTAGAAGTGCCTATTC 
AGAAAACTATCAAGGGGCTT^TAGGGAATACATTACCATTCTTTGCAAAGCT 
TGGGTATCAAGTGATTTAACCAGTAAAGATTTATTTGTCCGTCAAGGGTGGT 
TTAAATTCAAGAAAAAAAGAAGCGAAGG-TCAACGTGTTCATTTGTCAGAATG 



AGGCGCTGACAAATTCTTTTGACTTAGAGCATACATTC^ATTCAAGAGACTTT 
AAACAAGCTAGCAGAACGCCCTAAAACGGACACACAACTCGATTTGTTTAGC 
TATGATACAGGCTGAT^AATAA/ACCCGCACTATGCCATTACATTTATATCTA 
TGATACGTGTTTGTTTTTTCTTTGCTGTTTAGCGAATGATTAGCAGAAATAT 
ACAGAGTAAGATTTTAATTAATTATTAGGGGGAGAAGGAGAGAGTAGCCCGA 
AAACTTTTAGTTGGCTTGGACTGAACGAAGTGAGGGAAAGGCTACTAAAACG 
TCGAGGGGCAGTGAGAGCGAAGCGAACACTTGATTTTTTAATTTTCTATCTT 
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TTATAGGTCATTAGAGTATACTTATTTGTCCTATAAACTATTTAGCAGCATA 
ATAGATTTATTGAATAGGTCATTTAAGTTGAGCATATTAGAGGAGGAAAATC 
TTGGAGAAATATTTGAAGAACCCGATTACATGGATTGGATTAGTTCTTGTGG 
TTACGTGGTTTTTAACTAAAAGTAGTGAATTTTTGATTTTTGGTGTGTGTGT 
CTTGTTGTTAGTATTTGCTAGTCAAAGTGATTAAATA 
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Figure 3 
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Figure 4 



